Using generalized additive models to reduce residual confounding.

نویسندگان

  • Andrea Benedetti
  • Michal Abrahamowicz
چکیده

Traditionally, confounding by continuous variables is controlled by including a linear or categorical term in a regression model. Residual confounding occurs when the effect of the confounder on the outcome is mis-modelled. A continuous representation of a covariate was previously shown to result in a less biased estimate of the adjusted exposure effect than categorization provided the functional form of the covariate-outcome relationship is correctly specified. However, this is rarely known. In contrast to parametric regression, generalized additive models (GAM) fit a smooth dose-response curve to the data, without requiring a priori knowledge of the functional form. We used simulations to compare parametric multiple logistic regression vs its non-parametric GAM extension in their ability to control for a continuous confounder. We also investigated several issues related to the implementation of GAM in this context, including: (i) selecting the degrees of freedom; and (ii) alternative criteria for inclusion/exclusion of the potential confounder and for choosing between parametric and non-parametric representation of its effect. The impact of the shape and strength of the confounder-disease association, sample size, and the correlation between the confounder and exposure were investigated. Simulations showed that when the confounder has a non-linear association with the outcome, compared to a parametric representation, GAM modelling (i) reduced the mean squared error for the adjusted exposure effect; (ii) avoided inflation of the type I error for testing the exposure effect. When the true confounder-outcome relationship was linear, GAM performed as well as the parametric logistic regression. When modelling a continuous exposure non-parametrically, in the presence of a continuous confounder, our results suggest that assuming a linear effect of the confounder and focussing on the non-linearity of the exposure-outcome relationship leads to spurious findings of non-linearity: joint non-linear modelling is necessary. Overall, our results suggest that the use of GAM to reduce residual confounding offers several improvements over conventional parametric modelling.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Key Words:

SUMMARY Diagnostic residual and partial residual plots are proposed to identify nonlinearity in covariates for generalized linear models. It is shown that these plots are complement each other in the detection of nonlinearity. In addition, a transformed additive model which allows the resulting residual and partial residual plots to be obtained through the standard least squares approach is pro...

متن کامل

Genetic analysis of ewe body weight in Lori-Bakhtiari sheep using random regression models

(Co)variance components and genetic parameters for test day ewe body weight of Lori-Bakhtiari sheep were estimated using a random regression model (RRM). The data consisted of 22153 individual body weight records, obtained from 1994 ewes (progeny of 205 sires and 1010 dams) between 371 and 3416 days of age, collected from the flock stud of Lori-Bakhtiari Sheep Breeding Station in Shahrekord, Ir...

متن کامل

Probabilistic Precipitation Forecasting Based on Ensemble Output Using Generalized Additive Models and Bayesian Model Averaging

A probabilistic precipitation forecasting model using generalized additive models (GAMs) and Bayesian model averaging (BMA) was proposed in this paper. GAMs were used to fit the spatial-temporal precipitation models to individual ensemble member forecasts. The distributions of the precipitation occurrence and the cumulative precipitation amount were represented simultaneously by a single Tweedi...

متن کامل

Predicting the Potential Habitat Distribution of Crataegus Pontica C. Koch, Using a Combined Modeling Approach in Lorestan Province

Habitat degradation is one the important reasons of plant species extinction. Modeling techniques are widely used for identifying the potential habitats of different plant species. Thus, the purpose of current study was to determine potential habitats of Zalzalak in Lorestan Province. Species presence data and 23 environmental variables were collected in Lorestan Province. Correlation analysis ...

متن کامل

The severity of the relationship between daily air pollution and cardiovascular deaths in Ahvaz, Iran- using generalized additive models (GAMs) for seven years during March 2008 - March 2015

Abstract Background and objectives: Some epidemiological evidence has shown the relationship between environmental air pollution and adverse health effects. The aim of this study was to evaluate the effect of daily air pollution on daily cardiovascular mortality in Ahvaz city. Materials and Methods: In this ecological study, air pollution data was inquired from the Ahvaz Environmental Protectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics in medicine

دوره 23 24  شماره 

صفحات  -

تاریخ انتشار 2004